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Abstract 

In this paper, we consider the one-term Edgeworth expansion for finite population L-statistics. 
We provide an explicit formula for the Edgeworth correction term and give sufficient conditions 
for the validity of the expansion which arc expressed in terms of the weight function that defines 
the statistics and moment conditions. 
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1 Introduction 

Consider a population X = {xi, . . . ,xn} of size A^. We assume, without loss of generality, that 
xi < • • • < Xfyf. Let X = {Xi, . . . ,Xn} be a simple random sample of size n < N drawn without 
replacement from X. Let Xi-n < • • • < ^n:n be the order statistics of X. Let ci, . . . , be a sequence 
of real numbers. Define the L-statistic 



1 " 

Ln = -^n(X) = — CjXj-n 



n 

Note that L„ is a symmetric statistic, i.e., for every permutation of X the value of L„ remains the 
same. 

In the case where random variables X\^ . . . , X„ are independent and identically distributed (i.i.d.), 
L-statistics were studied by a number of authors. Asymptotic normality under various conditions was 
shown in Chernoff et al. [6], Shorack [13] and Stigler [M], among others. Berry-Esseen bounds were 
obtained in Bjerve [3], Helmers [7], and others. Later such bounds were generalized to symmetric 
statistics in van Zwet [15]. Large deviations for L-statistics were considered in Bentkus and Zitikis 
[2]. Edgeworth expansions for L-statistics were established in Helmers [8], Bentkus et al. [1], see also 
Putter [To]. A general second order asymptotic theory of symmetric asymptotically linear statistics 
was developed in [1]. Empirical Edgeworth expansions for symmetric statistics were considered in 
Putter and van Zwet |llj . 

In the case of samples drawn without replacement from finite population we know considerably 
less about L-statistics. As to issues on the asymptotic normality of L-statistics we refer to Shao [12] . 
The one-term Edgeworth expansion for general asymptotically linear symmetric statistics, based on 
samples drawn without replacement, is obtained in Bloznelis and Gotze [5]- Empirical Edgeworth 
expansions for symmetric finite population statistics are constructed in Bloznelis [4]. 
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Our analysis is based on the Hoeffding decomposition 

Ln = BLn + Ui + --- + Un (1) 

where 

Um= J2 9m{Xi^,---,Xi,J, m = l,...,n. 

l<ii<---<im<ri 

Here symmetric and centered kernels gm,m = 1, . . . ,n are certain linear combinations of conditional 
expectations 

hj {xki , . . . ,Xk.) = E {Ln -ELnlXi = Xki, ■ ■ ■ ,Xj = Xk.) , l<j<m, 

such that Urn are mutually uncorrelated. The decomposition in ([1]) is also called an orthogonal 
decomposition of Ln- For {/-statistics, based on i.i.d. observations, the orthogonal decomposition 
was introduced in Hoeffding [9]. In the case of samples drawn without replacement, the orthogonal 
decompositions of U -statistics of the fixed degree m were studied in Zhao and Chen [16] . In the general 
case of symmetric statistics (including L-statistics), based on samples drawn without replacement, 
the orthogonal decomposition was studied in Bloznelis and Gotze [5]. In particular, [5j provides 
expressions for the decomposition kernels as follows 

N -1 

gi{x) = -i-j hi{x), (2) 

iV — n 

, , N-2 N-3 / , , ^^-1/, . ^ , . ^^^ 
92ix,y) = j^_^^j^_^^_^ [h2{x,y) - j;^{hi{x) + hi{y)) j, (3) 

(4) 



, N-3 N-A N-5 / , , N-2. ,,,,,, ^ , 
93ix, y, z) = ^ _^_A h:i{x, y, z) - ^^-^ {h2{x, y) + /i2(x, z) + /i2(y, z)) 



N -IN -2 , , , , , , , 
+ AT _ 3 AT -4 (^1^^^ + ^i(^) + ^i(^)) 

Clearly gi{-), g2{-, ■), ■ ■ ■ depend on the weight sequence ci, . . . , and the distribution of ^1 . The 
first issue, addressed in this paper, is to find a convenient expression of kernels Qi, i = 1, 2, 3 in terms 
of the weights ci, . . . , c„. For i.i.d. samples such a problem is solved in Putter [TU], see also Putter 
and van Zwet |TT]. Our purpose is to obtain similar formulas for samples drawn without replacement. 
Our result is given in Theorem [2] below. 

The second issue, addressed in this paper, is to find specific conditions for ci,...,c„ and the 
distribution of Xi that ensure the validity of the one-term Edgeworth expansion. This problem is 
important since it is difficult to check the general conditions given in Theorem 2 of Bloznelis and 
Gotze [5]. 

Define differences of Cj recursively by 

A°(cj) = Cj, A^{cj) = Cj - Cj-i, A''(cj) = A^(A''-^(cj)), t; = 2, 3, . . . , n - 1. 

Introduce the smoothness conditions for the weights of an L-statistic as follows: assume that for some 
constants a, b, c and d we have 

max |A'^(co)| < a, n max |A^(co)| < b, max |A^(c,)| < c, (5) 

l<i<n 2<j<n ' ' 3<j<n ' ' 

max |A^(c,)| < d. (6) 

4<j<n ' ^ -J'' 
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How to check these conditions for arbitrary sample size n? The weights of many important L-statistics, 
such as sample mean, trimmed means and Gini's mean difference, may be written in the following 
way: there exists a function J: (0, 1) — ^ M such that Cj = J{j / {n + 1)), j = 1, . . . ,n. Then condition 
([5]) is fulfilled if J(-) has a bounded second derivative. Similarly, boundedness of the third derivative 
of J(-) implies ([6]). Thus, we restrict ourselves to the case of smooth weights. For the example of 
sample mean we have J = 1, and, in the case of Gini's mean difference, the weights are of the form 
Cj = (n + 1) J(j'/(n + l))/{n — 1), 1 < i < n, with J{u) = 2(2n — 1). Clearly, in these cases conditions 
([5]) and ([6]) are satisfied. A trimmed mean has the weight function J{u) = {t2 — ti)~^ I{ti < u < t2}, 
where < ti < t2 < 1 are trimming parameters. Here !{•} is the indicator function. The smoothness 
conditions are not fulfilled for this example. For more examples of L-statistics with weights generated 
by the function J(-) we refer to Chernoff et al. [6]. 
Consider the normalized L-statistic 



Write 
Denote 



Sn = n-2{Ln-BLn). (7) 
al = Var 5„ and cr? = E gl{Xi). 



= Npq, p = n/N, q = 1 — p and n* = min{n, — n}. 

We are going to apply the general result on Edgeworth expansions for symmetric finite population 
statistics from Bloznelis and Gotze [5]. It follows from Theorem 2 of [5] that 

Gn{x) = $ X - ^ <^^'^>{x) 

or 

provides the one-term Edgeworth expansion for the distribution function Fn{x) = ^{Sn < xdn}. 
Here stands for the third derivative of the standard normal distribution function ^{x), and 

Note that, using explicit formulas for gi{-) and g2{-, ■) given in Theorem [2] below, one can evaluate a, 
K and ai- Define the error of the approximation by 

A„ := sup \Fn{x) - Gn{x) \ . 

X 

Given M — )• C, write = supci<\t\<b\9{'t)\- Consider the following nonlattice condition: for 

every e > and every B > 0, the function {p{t) = E exp{ita^^ gi{Xi)} satisfies 

liminf M\[ B] < 1- (8) 

Consider also the Cramer-type condition 

liminf \\(p\\i^ < 1. (9) 

Asymptotic conditions ([8]) and ([9]) are finite population analogues of the nonlattice and Cramer 
conditions familiar from the traditional case of i.i.d. observations. They are imposed on the linear 
part of the statistic only, as well as in the case of the usual sample mean. In fact, in the i.i.d. case, 
the practical validity of the corresponding smoothness conditions for (infinite) population is more like 
intuitive, and in finite population settings one can treat them similarly. 
Our main result is the following statement. 



3 



Theorem 1. Assume that an remains bounded away from zero as n ^ oo. 

(i) Assume that ([5]) and ([8]) hold and that E < oo for some (5 > 0. Then A„ = o(n* '^^'^) 

as n^:, N ^ oo. 

(a) Assume that ([6]) and ^ hold and E \Xi\^ < oo. Then A„ = 0{n~^) as n^,N oo. 

Note that, conditions of Theorem [1] are similar to that used in the i.i.d. situation, see Helmers [8], 
Putter [To] and Bentkus et al. jT]. Our proofs are purely combinatorial and differ from those used in 
the case of the i.i.d. observations. In Section [2l we get explicit expressions for the first three terms 
of orthogonal decomposition. In Section [3l we prove Theorem [TJ The performance of the Edgeworth 
expansion is illustrated by a numerical example in Section [H Proofs of lemmas are given in Section 

El 



2 Orthogonal decomposition 

Given < m < n and 1 < ki < ■ ■ ■ < km < N, introduce the event 

= ^Xfej^ - Xfe^ = {^1 = Xkii ■ ■ ■ i^m = ^km}- 

For convenience of notation, we define ko := and km+i := N + 1. Introduce numbers xq := xi 
and X7V+1 := xn and define Xo:n := a^o and Xn+i-.n ■= a^Ar+i, so that, almost surely, we have 
Xo:n < Xj-^n < -^n+i:n for each I < j < u. In the proof of Theorem [2] below we represent order 
statistics by sums of sample spacings 

Xj-n = '^^r:n + Xq, j = 1, . . . n. (10) 
r=0 

Here A^-.n = Xr+v.n—Xr-.n-, r = 0, . . . , n denote the sample spacings. Write Aj= Xj+i— Xj, i = 0, . . . , A''. 
Lemma 1. For any m = 0, . . . ,n and r = 0, . . . ,n we have 

^.^ fN-mY^V^ ''^ fi-s + l\fN-i-m + s-l\ 

\n — m J ^-^ ^-^ \r — s + 1/ \ n — r — m ^ s — \ 

^ ' S = l i=ka-\ ^ ^ ^ ' 

Denote by 'HN,n,i{j) = (p in-j) I (ri) probability that a hypergeometric random variable with 
parameters A^, n and i attains the value j. Denote [A^]j = iV(A^ — 1) • • • (A^ — j + !)• Next we give 
explicit and comparatively simple expressions of kernels ([2|) , ([3|) and dH . 

Theorem 2. 

(i) Forl<k<N 

n N-\ 

gi{xk) = -n-^ ^ Vk{i)nN-2,n-i,i-i{j - 1) Ai, (12) 

i=i i=i 

where 

j-i/N ifl<i<k, 



^k[i) 



I -i/N ifk<i<N. 
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(ii) Forl<k<l<N 



N-l 



92{xk,xi) = -n ^ ^ A^(cj) ^ 4'k,i{i)T-LN-i,n-2,i-2{j - 2) Aj, 
i=2 



(13) 



1=1 



where 



(in) For l<k<l<m<N 

92,{Xk,Xi,Xm) = 

where 



i{i - 1)/[N -1]2 ifl<i<k, 
-{i-l){N -i-l)/[N -1]2 ifk<i<l, 
^{N -i-l){N -i)/[N -1]2 ifl<i<N. 



(14) 



n 7V-1 
^ A2(Cj) ^ 0k,l,m{i)'HN~6,n~3,i^3{j " 3) Ai 
j=3 i=l 



(15) 



2) / [N -2]3 ifl<i<k, 

{i - l)(i - 2){N - i - 2)/[N - 2]3 ifk<i<l, 

-{i - 2){N - i - 2){N - i - l)/[N - 2]3 if I < i < m, 

{N-i- 2){N - i - 1){N - i)/[N - 2]3 ifm<i<N. 



(16) 



Proof, (i) First we write a kernel of orthogonal decomposition of the order statistic Xj-n^ j = ■ ■ ,n. 
For chosen 1 < k < N, using representation (jlOp and Lemma [1] for m = 0, 1, we have 



9ij{xk) 



N -2 



n 



-1 i-i (k-l 



N-l 



1 E E>(")A.-E 1 



r=0 Ki=l 



i=k 



N 



h2{i,r) Ai } , 



where 



i \r I \ \n — r — 1 I N \ n — r 



and 



722 



{i,r) 



N fN -i\ { fi-l 



N — i\n — rj IVr— 1/ N \r 



n 1 1 



It is easy to verify that 02i{i,r) = ^22(^1 '')• Next, using induction it is easy to show that for every 
j = l,...,n 

'i-l\fN-i-V 
vJ - 1/ V j 



i-i 

E^22(i,r) 

r=0 



and the proof of part (i) follows from a simple observation that 

n 

9i{xk) =n~^^Cjgij{xk), l<k<N. 
i=i 
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(ii) Similarly, for chosen 1 < k < I < N , using representation (jlOp and Lemma [T] for m = 0, 1, 2, 
for a single order statistic we get 



r=0 L i=l i=k 

"t:^ [N -i){N -i-l)^ , 1 



7 L I 

1=1 ) 

where 

[^^ - 1]2 /A f /iV - « - 2\ n-\(N-i-\\ n{n-l)fN-i 



and 



732 



'^^''"^ 1) VrJ I Vn-r-2j ^iV - 2 - r - 1 J ^ [A^ - 1]2 - 

[iV - 1]2 



(i,r) 



(i - 1)(7V -i-l) 



i-l\\/N-i-l\ n-lfN-i 



r — Ij I \n — r — 1 J N — 2 \n — r 
n — 1 f i\ { f N — i — 1\ n f N — i 



N — 2\r J \ \n — r — 1 J N — 1 \n — r 



and 



,33(, .) =^-Kzn2 /iV - A f /i - /i - 1 V „(„ - 1) fi 



{N -i){N -i-l)\n-r J {\r -2j N-2\r-lJ [N-l]2\r 
Similarly 03i{i,r) = 932ih^) = (^ssih^)- Next, using induction one can show that, for every j 



1, . . . ,n, 



'i-2\/N-i-2\ fi-2\fN-i-2 



E^33(i,^ 



r=0 



j - IJ \n- j - IJ \j -2J \ n- j 



To complete the proof of part (ii) we observe that 



g2{xk,xi) = n ^'^Cjg2j{xk,xi), 1 < k < I < N 



and apply 



i-2\/N-i-2 
j -2j\ n-j 



" " ( i — 2\ ( 

E - ^i) = Cn6„+i - ci6i - ^(cj - Cj_i)6j, where =(._„) ( 

i=i i=2 ^-^ ^ 

(^iiij The proof of this part is very similar. □ 

3 Edgeworth expansion 

Decomposition ([T|) is a stochastic expansion of an L-statistic and the first few terms of the decomposi- 
tion can be quite an excellent approximation to L„. In order to control the accuracy of approximation, 
one can use the smoothness conditions defined below. 
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Let (Xi, . . . , Xjy) denote a random permutation of the ordered set (xi, . . . ,X]\f) whicli is uniformly 
distributed over the class of permutations. Then, the first n observations Xi, . . . , Xn represent a 
simple random sample from population X. For j = 1, . . . , N — n denote Xj = Xn+j- Define 

Ln = Ln{Xi, . . . ,Xn) — Ln{Xi, . . . .,Xj^i,Xj+i, . . . ,Xn,Xj). 

Higher order difference operations are defined recursively: 

They are symmetric; that is, D^^'^^Ln = D^^'^^Ln, etc. Write 

6k = 6k{Ln) = B (ni^-^^BkLnY , B^Ln = D^'^^-^ L^, I < k < n,. 



We shall write the differences UfcL^, k = 1, 2, 3, 4 in terms of differences of the weights ci, . . . , c^. 
For that purpose we need additional notation. For k = 1,2,3,4, denote by Xi-n+k < • • • < Xn+k-.n+k 
order statistics which correspond to the sample = {Xi, . . . , Xn+k}- Let = {Ri, . . . , Rn+k} 
be the ranks of the sample X^. assigned as follows: introduce the function f : X ^ {1, . . . , A^} given 
by /(a^fc) = /c. A; = 1, . . . , A^. Then we decide that Ri < Rj if f{Xi) < f{Xj), and so on. Thus 
ranks IR^ are all distinct, i.e., is a random permutation of the set {1, . . . , n + k}. Further, denote 
M-l = {Ri, . . . , Rk, Rn+i, ■ ■ ■ , Rn+k} and let TZk be a set of all permutations of the set M^, where a 
particular permutation means the arrangement of elements of the set by size. Let Ri;2k < • • • < 
Rk:2k < Rn+i:2k < ' ' ' < Rn+k:2k denote Order statistics which correspond to M^. 

Lemma 2. For each k = 1, 2, 3, 4 there exists a random variable dk = dki^l) ^^^^ values in {—1, 0, 1} 
such that 

B)kLn = dkn~^ A'="^(cj)Aj;„+fc. 

j = Rk:2k 

Consider further the normalized L-statistic Sn, defined in ([7]). It will be seen from the proof of 
Theorem [1] below how conditions ([5]) and ([6]) for the weight sequence ci , . . . , (together with an 
appropriate moment condition for Xi) correspond to the smoothness conditions 



5k{Sn) = 0{n^^) as n^.,N 



oo, 



for /c = 3, 4. 

For the statistic Sn from Theorem [2] we have kernels gi, g2 and which differ from ()12p . (fT3 
and (lisp only by the multiplier n~^/^. Write 



/3. = E 



np5i(Xi) , -fs = E n7'g2{Xi,X2) , C = E n^'^^sC^i, ^2, X; 



3/2 



5/2 



Theorem [T] is the consequence of the following theorem. 

Theorem BG (Theorem 2 of Bloznelis and Gotze [5])' Assume that o"„ remains bounded away from 
zero as n ^ oo. 

(i) Assume that ([8|) holds, 5^ = o{n^, ) and for some 6 > 0, the moments fi-i+5 O'f^d 72+5 are 

— 1/2 

bounded as 7i^,N 00. Then A.„ = ©(n^, ) as n^,,N ^ 00. 

(ii) Assume that ([9]) holds, ^4 = 0(n~^) and, the moments /34, 74, C,2 o.re bounded as n^,N 00. 
Then A„ = 0{n~^) as n^:, N ^ 00. 
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Let us turn to the proof of our main result. 
Proof of TheoremUl Following van Zwet [H] we introduce the functions 

F{y)dy, H{x) = / {l-F{y))dy, 

-oo Jx (17) 



M{x)= r F{y){l-F{y))dy, 

J — oo 



where 



F{y) = ];^Y.n^i<y} (18) 

i=l 

is the distribution function of the random variable Xi. 

A simple integration of (fT8|) and some work with sums yield that at the points x = Xk-,k = 1, . . . ,N 
we have 

fc-l . N-l 



i=i i=k ^ ^ 



4 = 1 

Functions (jl7p are finite and monotone, and, similarly as in Putter |10] . we obtain for A; = 1, . . . , 

G{xk) + H{xk)<'E.\Xi\ + \xk\ (19) 

and 

M{xn) < G{xk) + H{xk) < E |Xi| + \xk\ . (20) 

One can show (we omit a detailed proof) that the quantities 4>k,i{'i') and 0k,i,mi'i')j defined by (jl4p and 
(flB]) . satisfy 

AT ^ 2 



6^(^)1 < (^^) l</'M«l<4|./.,/i)| (21) 



iV ^ ^ 



and 

l^fc,;,«^(i)l < [W^) \(^k,i,nM < 27|0fc,z,^(i)|, (22) 

where we write 

0l,,(i) = >k}- i/N) > /} - i/N) 

and 

^M,m,(^) = (I{^ > k} - i/N) >l}- i/N) {l{i >m}- i/N) . 
Then, by dS}, (USD and because YJj=i'^N~2,n~i,i~i{j - 1) = 1, for 1 < A; < A, 

\gi{xk)\ < an^2 ^ |(/?fc(i)| Ai= an~2 [^(xfc) + -ff(xfc)] < an~2 [E |Ai| + \xk\] , 

i=l 
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and thus for s > 1, 

N . N 



E \9liX^)r = ^ ^ \gi{x,)f < a^-t^ [E \X,\ + 



k-i ^ k-i ^23) 

fc=i 



Also, by jS]), 121]), ([H]), ([20]), because of monotonicity of G, H, M and Ei=2 '^A^-4,n-2,i-2(i-2) 
1, for 1 < /c < / < iV, 



iV-l Af-l 
3 . . 3 



\92{xk,xi)\ <bn 2 ^\(j)k,iii)\ Ai< Abn 2 ^{(p^ 



i=l i=l 

< 45n~i [G{xi) + M{xn) + H{xi)] < 8bn-^ [E \Xi\ + , 



and then for s > 1, 



^ ^ l<k<l<N ^ ^ l<fc<i<Ar 

< 23^6^n~i^('^') ^ 2«-i[(E|Xi|)^ + |x/|'] <24^+l6%-i^E|Xl|^ 

^ ^ l<k<l<N 



Similarly, by (P, ([221), ([T9]), ([20]), because of monotonicity of G, H, M and E"=3^Af-6,n-3,i-3(j- 
3) = 1, for 1 < A; < / < m < iV, 

\g3{xk,xi,Xm)\ < ^'^cn~2 [E \Xi\ + |xm|] , 

and then for s > 1, 

E\g3{Xi,X2,X3)\' <2'+^3^'c'n--2'B\Xi\\ (25) 

Note that, it follows from inequalities (j23p . (j24p . (|25p that, for both cases (i) and (ii), the moments 
/3s, 7s and Q in Theorem BG are bounded if the corresponding moments E are finite. 

Next we evaluate the quantities 53(S'n) and 54(5„). Let us introduce the event = {Rk:2k = 

i, Rn+i:2k = j}- Application of Lemma [2] and the use of the corresponding smoothness conditions ([5]) 
and Q for /e = 3, 4 yield 

E [{nf-^^^]>kSn? I ^k■,^J] < Cin-^ E [{Xj,n+k - Xi.,n-,kf I , (26) 

where we denote Ci = max 

{c2,d2}. It is a simple problem to calculate that 

«.-p{-..,}^(r_;)(":-:0/("-T 

where k<i<j<n + \. Since the events '^k\ij and ^k;ijim = {^i-.n+k = xi,Xj.n+k = Xm} are 
independent, for xi < • • • < x^ we have 

Pk,,lm = F{^k,,lm\^k,,}=(^~_'^ (7-^-l) {nVk-j)/{n + k) ' 



9 



where i<l<m<N — n — k + j and m — I > j — i. For xi < ■ ■ ■ < xn these probabihties are the 
same. It follows from the same argument as in the proof of Lemma [TJ 
Then, we derive from (I26p 

-xif, where Xk-im = ^ Pk;ijPk;ijlm- 
l<l<m<N k<i<j<n+l 



It is easy to check that 



^1 = ^ Yl (^"^ - ^')' 



l<l<m<N 



Clearly, to prove the bounds 6k{Sn) = 0{n^ ^), k = 3,4, it will now suffice to show that Xk;im 
0(iV-2) for all 1 < / < m < iV. 

Using the simple inequality (") < u" /v\ for all /c < i < j < ?i + 1, we obtain 



i-l\(n + k-i\ ^ (i-l)"-^ {n + k-jr-^ ^ [{i - l){n + k - 1 - i)f 



k-ij\ k-1 J - (k-iy. (k-iy. - [{k-iy.] 



2 



n^k-2\ n+k-2 



.ly (n + - 2)2(fe-i) 



Also, for ah 1 < / < m < iV 

<7<n+l ^ ^ s=fc-l i=0 



/m-l-l\/ N -m 

t )\n + k-2-s-t 



j.r \ -1 n+k~2n+k~2-s . ^ / , x ^ 



fc<i<j< 

< 



n + kj \n + k — 2 

In the last step here the generalized Vandermonde identity was applied. 
Finally, from (j27p and (I28p we derive 



n + kl \ s /V i /Vf^ + fe — 2 — s — i 

s=0 t=o 

iV N-2 



(27) 



(28) 



2k ) \A) [(fc-l)!]2 \n + kj \n + k-2 

for all 1 < / < m < A^, where 

(2A;)! (2fc-2)2'=-3 

^"-'^[ij p-i)!]2 (2k -sy. ' 

Application of Theorem BG completes the proof of both cases (i) and (ii) of the theorem. □ 
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4 An example 



We show, by the following simple example, how the Edgeworth expansion improves the usual normal 
approximation. A population X of size N = 100 was simulated from the logistic distribution with the 
distribution function H{x) = (1 + e~^)~^ for — oo < x < +oo. Our chosen population X has mean 
0.004 and variance 3.270. Consider the L-statistic with the weights cj = J{j/{n + 1)), 1 < j < n, 
where J{u) = 6u{l — u). Similarly as in the case of i.i.d. observations (see Chernoff et al. [6]), the 
defined statistic may be applied in estimation of a center of the population. 

For samples of sizes n = 5, 15, 30, we present several g-quantiles of the functions Fn, Gn and $ in 
Table[T]below. Here Fn is the Monte-Carlo approximation of Fn, obtained by drawing (independently) 
10^ samples without replacement from X . 



Table 1: Approximations to F^, n = 5, 15, 30 



0.01 0.05 0.10 0.25 0.50 0.75 0.90 0.95 0.99 





-2.120 


-1.534 


-1.224 


-0.692 


-0.063 


0.633 


1.319 


1.750 


2.581 




-2.078 


-1.557 


-1.249 


-0.704 


-0.058 


0.639 


1.323 


1.754 


2.563 




-2.159 


-1.577 


-1.253 


-0.694 


-0.040 


0.653 


1.308 


1.712 


2.489 




-2.160 


-1.585 


-1.259 


-0.695 


-0.039 


0.652 


1.308 


1.714 


2.487 


F3o\q) ^ 


-2.199 


-1.604 


-1.268 


-0.689 


-0.026 


0.663 


1.299 


1.688 


2.428 




-2.220 


-1.606 


-1.267 


-0.688 


-0.025 


0.660 


1.297 


1.687 


2.430 




-2.326 


-1.645 


-1.282 


-0.674 


0.000 


0.674 


1.282 


1.645 


2.326 



Table [U shows that, even for small sample of size n = 5, the Edgeworth expansion is much more 
efficient than the normal approximation. Note that, in practice, the population parameters a and k, 
which define the Edgeworth expansion Gn, should be estimated, see, e.g., Bloznelis [1]. 



5 Proofs of lemmas 



Proof of Lemma\^ Assume that xi < ■ ■ ■ < xjsf. Then, for any m = 0, . . . , n and r = 0, . , 
straightforward combinatorial calculations give 



N — m 
n — m 



m+l fcs — 1 

E E 

s=l i=kg-i+l 



i — s\ fN — i — m + s — 1 



r — sj\n — r — m + s — 1 



E"^"*"^ f kg — s\ f N — ks — m + 
\ r — s J \ n — r — m + s 

s=0 



The key idea is for r = 0, . . . , n to note that 



N — m 
n — m 



m+l fcs — 1 

E E 



I — s 
r — s + 1 



m+l 

+ E 

s=Q 



1 i = ks-l+l 

N — kg — m + s 



kfi 



r — s + 1] \n — r — m + s — 1 
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where 



and 



.(0 



N — i — m + s 
n — r — m + s — 1 



N — i — m + s — 1 
n — r — m + s — 1 



E {Xr:n \ ^m) 



N — m 
n — m 



m+l ks — 1 
s=l i=ks-i+l 



N — i — m + s — 1 
n — r — m + s — 1 



m+l 

■E 

s=0 



ha 



N — kg — m + s 



r — s I \ n — r — m + s 



Xi. 



where 



(r) 



i- s+l 
r — s + I 



I — s 
r — s + 1 



Then, it is easy to verify that, for r = 0, . . . , n, E {^r-.n \ ^m) is the same as in (jlip . For xi < • • • < xn 
this result does not change if we assume (without loss of generality) that coincident values of X are 
strictly ordered by their unique names, e.g., by sizes of their indexes. □ 

Proof of Lemma\^ The set IZi contains only 2 elements, therefore we elaborate the case k = 1. 
Let Ri < Rn+i- Then 



nOiLn = n(L„(Xi\{X„+i}) - L„(Xi\{Xi})) 

■-Rn+i— 1 n 

CjXj;n+l + ^ CjXj+i,n+l 

■ i=i j=R„+i 

Rn+l-l 

= - X CjAj-n+l. 

j=Ri 

Let i?i > Rri+i- Then 

nDiL„ = n(L„,(Xi\{X„+i}) - L„(Xi\{Xi})) 

CjXj-n+l + y^ CjXj+i-n+l 

j=iJ„+l 



Ri-1 



CjXj;n+l + ^ CjXj + i:„+i 
j = l j=Rl 



Next we calculate BfcL„,, /c = 2, 3, 4 recursively. Write B2-Ln = I^i-^^n — ^i^n, where 

^iLn = Ln{'^2\{Xn+l, Xn+2}) — Ln{'^2\{Xl , Xn+2}) , 
B^Ln = L„(X2\{X2,X„,+i}) - L„(X2\{Xi,X2}). 

Note that both components of O'lLn are dependent on X2 and are independent of Xn+2- For B'j'L^ 
it is conversely. Now by constructing the set 7^2 from the set TZi we find the following cases. 
For Ri < Rn+i, 
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-Rn+1 — 1 

Cj_iAj:„+2 for Rn+2 < Rl, 

j=Ri 

Rn+2—l Rn+1—1 

nB'iLn = { - ^ CjAj.,n+2 - ^ Cj-iAj.,n+2 for Rl < Rn+2 < Rn+1 

j = Rl j = Rn + 2 

Rn+1-1 

CjAj;„+2 for i?ri+l < Rn+2, 

j=Ri 

- ^ Cj_iAj;„+2 for -R2 < ^1, 

j=Ri 

R2-1 Rn+i-i 
nJ}[Ln = - X] ^j^r-n+'i - X] Ci-i^i;"+2 for Ri < R2 < Rn+i, 
j=Ri j=R2 

R,^ + l-l 

- X] CjAj:n+2 for Rn+1 < i?2- 



For Rl > -Rn+ij 



for iin+2 < Rn+l, 



nBiLn = < 



' Ri-1 

XI Cj_lAj:„+2 
i=-Rn+l 

-Rn+2-1 -Rl-1 

X] '^i^i:"+2 + X/ Cj_iAj:n+2 for -Rn+1 < -Rn+2 < -Rl, 

i=R„+i j=R„+2 
Rl-1 



nBiLn = < 



XI CjAj.„+2 


for i?i < -Rn+2, 


= -Rn+l 




' /?1-1 




X] Cj_lAj:„+2 


for -R2 < -Rn+i, 








-Rl-1 


< X] CjAj,n+2 + 


X/ '^i-i^i;"+2 for -Rn+i < R2 < Ri 


j = Rn + l 


j=R2 


Rl-1 




X] '=i'^i:'^+2 


for -Rl < -R2. 


J = Rn + l 





Note that, it suffices to consider only those elements of TZ2 for which we have -Ri > R2- 
Similarly, write BsLn = ID)2-^n — ^2-^n, where 

152-^". = Ln{Xs\{Xn+l, Xn+2, Xn-1-3}) — -Ln(X3\{X2 , X^+l , Xn+3}) 

— LnO^sXiXl, Xn+2, Xn+s}) + Ln^^sXiXl , X2, Xn+s}) , 

^2^n = Ln{Xs\{Xs, Xn+l, Xn+2}) — Ln(X-i\{X2, X-j, Xn+l}) 

- Ln{X3\{Xi,X3, Xn+2}) + Ln{Xs\{Xi,X2,X3}). 
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Note that all the components of O^Ln are dependent on X3 and are independent of Xn+s- For B'Jl^ 
it is conversely. Now, by constructing the set TZ^ from the set TZ2, similarly as in the case k = 2, 
we can recursively find Bs-Ln. Without loss of generality, we consider only those elements of TZ^ for 
which i?i > R2 > R3. The calculations are routine, long and cumbersome; therefore we omit them. 

The calculation of B4L„ is similar. Assuming that Ri > R2 > R3 > Ra, it suffices to consider 
(only) 8!/4! elements of the set TZ^. 

□ 
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